Compressed Vision for Efficient Video Understanding

نویسندگان

چکیده

Experience and reasoning occur across multiple temporal scales: milliseconds, seconds, hours or days. The vast majority of computer vision research, however, still focuses on individual images short videos lasting only a few seconds. This is because handling longer require more scalable approaches even to process them. In this work, we propose framework enabling research hour-long with the same hardware that can now second-long videos. We replace standard video compression, e.g. JPEG, neural compression show directly feed compressed as inputs regular networks. Operating improves efficiency at all pipeline levels – data transfer, speed memory making it possible train models faster much Processing signals has, downside precluding augmentation techniques if done naively. address by introducing small network apply transformations latent codes corresponding commonly used augmentations in original space. demonstrate our pipeline, efficiently popular benchmarks such Kinetics600 COIN. also perform proof-of-concept experiments new tasks defined over frame rates. long impossible without using representation.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Compressed-domain Object Detection for Video Understanding

In this paper, a novel algorithm for the real-time, unsupervised object detection in compressed-domain sequences is proposed. The algorithm utilizes color and motion information present in the compressed stream as well as a simple object model. Extraction of the MPEG-7 dominant color descriptor, clustering of macroblocks to dominant color clusters and model-based cluster selection are employed ...

متن کامل

An Efficient Watermarking Scheme for H.264/avc Compressed Video

Since H.264/AVC is the most widely-deployed video coding standard and has gained dominance, the necessity of copyright protection and authentication that are appropriate for this standard is unquestionable. According to H.264/AVC specific codec architecture, an efficient watermarking scheme for H.264/AVC video is proposed. The watermark information is embedded into quantized residual coefficien...

متن کامل

Cognitive vision systems for video understanding and retrieval

متن کامل

An Efficient Adaptive Boundary Matching Algorithm for Video Error Concealment

Sending compressed video data in error-prone environments (like the Internet and wireless networks) might cause data degradation. Error concealment techniques try to conceal the received data in the decoder side. In this paper, an adaptive boundary matching algorithm is presented for recovering the damaged motion vectors (MVs). This algorithm uses an outer boundary matching or directional tempo...

متن کامل

Video Abstraction in H.264/AVC Compressed Domain

Video abstraction allows searching, browsing and evaluating videos only by accessing the useful contents. Most of the studies are using pixel domain, which requires the decoding process and needs more time and process consuming than compressed domain video abstraction. In this paper, we present a new video abstraction method in H.264/AVC compressed domain, AVAIF. The method is based on the norm...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2023

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-26293-7_40